Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality.

نویسندگان

  • Peter C Austin
  • Jack V Tu
چکیده

OBJECTIVES Automated variable selection methods are frequently used to determine the independent predictors of an outcome. The objective of this study was to determine the reproducibility of logistic regression models developed using automated variable selection methods. STUDY DESIGN AND SETTING An initial set of 29 candidate variables were considered for predicting mortality after acute myocardial infarction (AMI). We drew 1,000 bootstrap samples from a dataset consisting of 4,911 patients admitted to hospital with an AMI. Using each bootstrap sample, logistic regression models predicting 30-day mortality were obtained using backward elimination, forward selection, and stepwise selection. The agreement between the different model selection methods and the agreement across the 1,000 bootstrap samples were compared. RESULTS Using 1,000 bootstrap samples, backward elimination identified 940 unique models for predicting mortality. Similar results were obtained for forward and stepwise selection. Three variables were identified as independent predictors of mortality among all bootstrap samples. Over half the candidate prognostic variables were identified as independent predictors in less than half of the bootstrap samples. CONCLUSION Automated variable selection methods result in models that are unstable and not reproducible. The variables selected as independent predictors are sensitive to random fluctuations in the data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regression trees for predicting mortality in patients with cardiovascular disease: What improvement is achieved by using ensemble-based methods?

In biomedical research, the logistic regression model is the most commonly used method for predicting the probability of a binary outcome. While many clinical researchers have expressed an enthusiasm for regression trees, this method may have limited accuracy for predicting health outcomes. We aimed to evaluate the improvement that is achieved by using ensemble-based methods, including bootstra...

متن کامل

Prognostic Impact of Thrombolysis in Myocardial Infarction Risk Index on Hospitalization Mortality of Patient with Acute Pulmonary Embolism

Introduction: Acute pulmonary embolism (PE) is one of the deadly cardiovascular diseases. One of the indexes proposed in these patients for risk stratification is the Thrombolysis in Myocardial Infarction (TIMI) risk index (TRI), which includes three parameters of systolic blood pressure, age, and heart rate. This study aimed to evaluate the predictive value of TRI on in-hospit...

متن کامل

Complement factors in Acute Myocardial Infarction and Unstable Angina

Background: Coronary artery disease (CAD) is one of the most important and lethal diseases in the world. CAD represents a board spectrum of disease from silent ischemia at one end to sudden cardiac death at the other end. The middle of this spectrum consists of acute myocardial infarction (AMI) and unstable angina pectoris (UA). Recent data show that the inflammatory process plays a major r...

متن کامل

HEART RATE: A PREDICTOR OF EARLY MORTALITY IN PATIENTS WITH MYOCARDIAL INFARCTION

A number of epidemiologic studies have reported a positive relationship between heart rate, cardiovascular disease and mortality. To examine the correlation between heart rate and mortality after acute myocardial infarction (AMI), 2147 patients hospitalized in coronary care units in Isfahan were investigated in a cross-sectional study. Their heart rate was measured according to an electroca...

متن کامل

ارتباط پلی‎مرفیسم T13254C گلیکوپروتئین VI پلاکتی با سکته حاد قلبی زودرس

Background and Aim: Myocardial infarction (MI) is a major cause of morbidity and mortality worldwide. Epidemiological studies indicate that MI results from complex interactions between long-term environmental influences, concomitant disorders, and genetic susceptibility factors. Identification of genetic risk factors, particularly in premature MI, is very important. Since thrombosis plays a cri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of clinical epidemiology

دوره 57 11  شماره 

صفحات  -

تاریخ انتشار 2004